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Abstract. We study a class of coalescents derived from a sampling procedure 
out of TV i.i.d. Pareto(a) random variables, normalized by their sum, includ- 
ing /3— size-biasing on total length effects (/3 < a). Depending on the range 
of a, we derive the large N limit coalescents structure, leading either to a 
discrete-time Poisson-Dirichlet(«, — /3) S— coalescent {a G [0, 1)), or to a fam- 
ily of continuous-time Beta(2 — q, a — /3) A— coalescents {a g [1, 2)), or to the 
Kingman coalescent (o > 2). We indicate that this class of coalescent processes 
(and their scaling limits) may be viewed as the genealogical processes of some 
forward in time evolving branching population models including selection ef- 
fects. In such constant-size population models, the reproduction step, which is 
based on a fitness-dependent Poisson Point Process with scaling power-law(Q) 
intensity, is coupled to a selection step consisting of sorting out the A'^ fittest 
individuals issued from the reproduction step. 

Running title: Pareto genealogies in a Poisson evolution model with se- 
lection. 

Keywords: Pareto coalescents, scaling limits, Poisson-Dirichlet, Kingman 
and Beta coalescents, Poisson Point Process, evolution model including selec- 
tion. 



1. Introduction 

We first investigate discrete-time finite coalescents derived from sampling from N 
i.i.d. Pareto(a) random variables, normalized by their sum, with a > 0. We 
include size-biasing on total length effects involving a parameter /3 < a. Depending 
on the range of a, we derive the large N limit coalescents structure: The case 
a G [0, 1) leads to a discrete-time Poisson-Dirichlet (a, —(3) S— coalescent (with 
no time-scaling). The case a — 1 gives rise to a continuous-time beta(l, 1 — (3) 
A— coalescent, involving a logarithmic time scaling log A^. The case a G (1, 2) leads 
to a continuous-time Beta(2 — a, a — /?) A— coalescent, involving a power-law time 
scaling according to N°'~^. The range a > 2 gives rise to the standard Kingman 
coalescent (with time scaling if a > 2 and N/ log in the critical case a = 2). 
We give for each case the exact speeds of convergence. We establish a loose link with 
a Generalized Central Limit Theorem for stable random variables and we briefly 
recall the main statistical features akin to general A— coalescents. 

We indicate that the above special classes of coalescent processes (and their scaling 
limits) may be viewed as the genealogical processes of some forward in time evolving 
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branching population models including selection effects. These models are closely 
related in spirit to the additive exponential model discussed in Brunet et al [6] , [7] . 

In the models we first consider, the size N of the population is kept constant over the 
generations. Each alive individual is assigned some positive fitness a; > 0. In each 
generation and for each of the N offspring alive independently, the reproduction 
step is based on a fitness-dependent Poisson Point Process (PPP) with scaling 
power-law(Q;) intensity; this procedure assigns new fitnesses to the (potentially 
infinitely many) individuals of the next generation, in a multiplicative way. We call 
/ (x) = the output fitness of an individual with fitness x. The selection step 
then consists of sorting out the N fittest individuals issued from the reproduction 
stejE. The process is iterated independently over the subsequent generations. To 
the first large A'^ approximation, the logarithm of the mean output fitness within 
each generation k, scaled by the generation number k, is shown to shift to the right 
at speed vn = log log TV as fc — )■ oo. 

While adopting a sampling point of view based on the intensity of the PPP to 
compute the coalescence probabilities that some offspring is the one of a parental 
individual with given fitness, it is shown that the genealogy of the branching model 
with selection is in the domain of attraction of the beta(l, 1 — /?) coalescent in the 
large N limit (Bolthausen-Sznitman coalescent if /3 = 0). It is also shown that the 
full class of the Pareto-coalescents discussed earlier can be obtained while consid- 
ering the PPP which is the output image of the original one, given by the output 
map / {x) = x" in fitness space. In this setup, the large N limit computations of 
the coalescence and merging probabilities are based not on the fitnesses but on the 
output deformed fitnesses. 



2.1. Pareto sampling and coalescents. Let Xi,..,Xn be N i.i.d. Pareto(Q;) 
random variables (rvs) with P {Xi > x) — x^", a > and a; > 1. Let Fxi {x) — 
1 — P (Xi > x) denote its probability distribution function (pdf ) . The density of 
^1 is fxi {x) — ax~^°'~^^\ Let Sn ^nf^i ^n, n — 1,---,N define a random 
partition of the unit interval, upon normalizing the Xs by their sum. The SnS 
are identically distributed but not independent of course as they sum up to 1; by 
doing so, the unit interval [0, 1] is thus broken into N random pieces (subintervals 
or segments) of sizes Sn, n — 1, N. 

By sampling the Ss, we mean that we draw independently i uniform random vari- 
ables on the unit interval with i < N, looking at the subintervals which are being hit 
in the process. From this procedure, for instance, the probability that the i— sample 
hits any one of the S'„s only once is 



This particular way of introducing selection in a randomly evolving branching population with 
constant poulation size seems to appear first in [5]. It has nothing to do with the way selection is 
classically introduced in population genetics; see | 28| . |14| and [8]. 



2. Coalescents derived from Pareto-Sampling 
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Let TiN '■— '^n=i-^n denote the partial sum of the Xs. For the values of /3 for 
which E (^Af) exists, we can size-bias the latter probability by the total length Ejv 
and consider insteacQ 



E 



E E E 



The latter event under consideration corresponds to an i to 1 merger of some Markov 
coalescent process where i particles are identified to a single one (share the same 
ancestor) whenever the i— sample hits the same subinterval of the unit partition. 
In this setup, -P/^'' is therefore the entry (i, 1) of its one-step transition matrix. 

The quantity cn '■— P2^\ which is the probability that two individuals chosen 
at random out of N share the same common ancestor, is called the coalescence 
probability. 

Similarly we can define a i to j merger (j < i) by considering the event that the i 
particles hit any size— j subset of the segments S constituting the partition of unity. 
We get 

(^) ± (,:..>(ri*.") ^ 

^ ii+...+ij=i ^ ^ \i=i J 



)E(-ir'(;)E((^i + -+^/r 



where the star-sum in ([2|) runs over the ii > 1. The quantity E ^H/^i ^i'^ '^^ ^^e 
probability of a ij) —merger from i to j. Using the same abuse of notation, 

we shall also write the size-biased version of the latter probability as 



Unless stated otherwise, whenever we speak in the sequel of P^'^'^ , we mean ^ and 
not ^. 

Clearly, X]j=i = 1 '^^'^ ^'^ ^^le matrix p(^) with entries pI^\ i — 

and j = is a TV x TV lower-triangular stochastic matrix corresponding to 

some Markov discrete-time-A: coalescent (pure death) process, say x^^\ with finite 
state-spac^l and state {1} absorbing. Let us first investigate the expression of the 
size-biased probability Pj;^\ showing that its large N estimate depends on the 
understanding of the /?— moments of E^r. 



^We abusively use the same notation j in the size-biased setup as in JTJ (corresponding to 
/3 = 0), to avoid overburden notations. 

■^A 'true' coalescent process takes values in the set of equivalence relations or partitions on 
{1, ...yN} and we rather deal here and throughout with its block-counting counterpart. 
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Proposition 1. When —00 < /3 < a < 2, the size-biased probability of an i to 1 
merger reads 



Proof: We have 1/Si = 1 + S^^^/Xi where := J^^^^^ X„ ^ ^^'^j/ X„ 

X^TV-i. By conditioning on Xi, with f-^N-i density of XItv-i, we get 

E (S^ j ^1 

Reversing the integration and after two changes of variables 



Na 



(4) 



E , 

E 
E 



dz ■ zf"-' {z - If 



/ sP-y^,_As)ds 
Jn-1 



When /3 < a < 2, the latter integral is (upon adequate normalization by a beta 
function term i? (2 — a, a — /?)) identified with the order i — 2 moment of a beta(2 — 
a, a — (5) rv. In this parameter range, for i > 2 

^ 1-2 i-af. -.a-is-i, r{i-a)T{a~ (3) 
u u (1 — u) du ^ — ~:Bn — a,a — p) 

r(«-p) 

are well-defined, o 

The Pareto rv Xi has power-law tails with index a. Therefore E (^^f ) only exists 

when /3 < a, with E (^^f ) — a/ {a ~ /3) . Clearly also, the tails of the sum Ejv obey 
P(SAr>s) - iVP(Xi > s) = iVs-". Because A^P(Xi > s) - P {Mn > s) = 

1 — (1 — s^")^ where — max (Xi, .., X^), this means that for large s, the event 
Sat > s is essentially determined by the event Mn > s. Note that, as a result, Ejv 
has the same tail index as Xi, indicating that the /?— moment of S^r only exists for 
P < a. 

We will show below from this, that large N estimates of E {^n^ can be obtained. 

As a result, for example, based on ([4]), whenever 1 < a < 2 and /3 < a, it will be 
checked that 



E 



N- 
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We will show that, in that case, for each i > 2, the limits limjv_j.oo Cp/P^'^^ exist, 
with 

c-ipW ^ I f\^-^A(du)>0 

""^^^^^ N^ooB{2-a,a-f3) T {z - f3) " ^ ^W>U, 

where A (du) has the density on [0, 1]: u^^" (1 — u)" ^ ^ / B {2 — a,a — j3), which 
is a beta(2 — a,a — P) density. 

Because in the large N limit, simultaneous multiple collisions will be seen to be 
negligible, we conclude, using similar arguments to the ones in ^36j and 30i, that, 
in the range 1 < a < 2 and (3 < a, a time-scaled version of the finite discrete-time- 
k coalescent a:^^'' arising from size-biased sampling out of Pareto(Q;) partitions 
converges weakly (as N oo) to a continuous-time-i A— coalescent xt where A is 

a beta(2 — a,a ~ /3) probability measure on [0, 1] ; namely xt = limAr^oo 
In other words, if t is continuous-time, the appropriate scaling is A: [t/cAr] (the 
integral part of t/cN), showing that time t should be measured in units of :— 
(the effective population size). The limiting process xt is a continuous-time pure 
death Markov process on N = {1,2,...}, absorbed at state {1} and with transition 
rates from i to j given by 



The rate terms Xij may also be written as 



^(-1)7 * 



n \i -I 



qi+i-j+i, 



where qi := ^A (du) are the /—moments of u ^A (du) and also the rates of I 

to 1 mergers A;_i. 



2.2. Generalities on A— coalescents. Whenever one gets a family of rates Xij 
as above for some finite probability measure A on [0, 1] , one speaks of continuous- 
time-t A— coalescents (see [33] for a precise definition). These A— coalescents are 
non- increasing pure death Markov processes, say Xi, on the state-space N, and the 
Xi.jS are the rates at which i to j < i mergers for xt occur. The state {1} is 
absorbing. In such processes, multiple collisions of any order (when 1 < j < i) 
can occur, but never simultaneously. The total death rate at which some merger 
occurs, starting from state i, is \i :— Y^]Ji Xij. One can check that when A = (5o 
(corresponding to the Kingman coalescent), Xij =/= only when j = i — 1 with 
Xi = Ai_i_i — (2). The general expression of A^ is 

Xi= [ u^^ (1 - (1 - -iu{l- uY^^) A {du) . 

"'[0,1] ^ ^ 



Here, because two processes are involved, the symbol = means convergence of all the finite- 
dimensional distributions of xf,^' , , t > to the ones of xt, t > 0. 
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When A ({0}) — (excluding thereby the Kingman coalescent), the precise dynam- 
ics of xt when started at i is given by xo = « and [3] 

(5) xt-xo^- (B(xs_,u)-lg(^ \^AAf{dsxdu) 

J{0,t]x(0,l] ^ V ; / 

= -/ {B{xs_,u)-1) Af{dsxdu). 

Here, x+ = max (x, 0), A/" is a random Poisson measure on [0, oo) x (0, 1] with inten- 
sity ds X (du) and B (^Xs_ , uj hm(^Xs_ ,u) is a binomial rv with parameters 
[xs_ , u) . As a result, with 

(6) r{i):= (ui-l + {l-uy)u-^A{du), i> 0, 

J{0,1] ^ ' 

upon taking the expectation in ([5]), it holds that 

E [dxt \xt_) = -r {xt_) dt. 

From this, the quantity r (i) is the rate at which size i blocks are being lost as time 
passes by. Clearly r is also 

i-1 i-1 

r (i) = i - ^ jX^J ^^{i- j) ■ 

Consequently, the reciprocal function l/r (i) of the rate r (i) interprets as the ex- 
pected time spent by xt in a state with i lineages and therefore I/'' (j) ^il^ 
give a large i estimate of the expected time to the most recent common ancestor 
(the height of the coalescent tree): 

Ti.i :— inf {t G M_|_ : Xt — I \ xq = i) . 

There are lots of detailed studies in the literature (see a precise partial list below) 
on other functionals of xt such as the total branch length Li of the A— coalescent, 
its total external branch length Lf, the length li of its external branch (the time 
till first collision of a branch chosen at random out of i), the number of collisions 
Ci till time to most recent common ancestor,... 



All these functionals obey some distributional identities which prove useful to obtain 
some insight on their limit laws as « — >■ oo. Given xq = i, all involve the number 
Ui of singletons taking part in the first collision occurring at time Ti ~ exp(Ai) , 
giving XTi — 1 = i — Ui singletons not participating to the first collision (with T^, XTi 
independent). The quantity Ui is important in itself because, due to P (Ui = j) = 

K,i-j+i/K, j = 2, .., z 

r(z)=A, (E([/0-l) 

where 

„ N i f 1 — u — (1 — uY . , , , 

E m = V / , A (du) . 

^^iJ {0,1] u{l-u) 

Provided A has no atom at point {!}, the condition J2iL2^/^i'^) < cjo is the 
necessary and sufficient condition for xt to come down from infinity, |38j . 
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Clearly indeetfl, = Ti + r^j^^,!, Li = iTi + L^^,, LI = iTi + i|^^_i, d = 1 + Cxj.. 

and ^ Ti + Bilx^.-i where Bi is a Bernoulli rv, given by: P {Bi = 1 | xt^) = 
(xTi — 1) /i with BilxT^.-i independent of Ti. 

Famous examples that we shall deal with in the sequel, include A— coalescents for 
which: 

- A {du) = B (a, h) u"^^ (1 — u)^^^ ("") du, with a, 6 > and with B (a, 6) the 
beta function: we get the beta(a, b) coalescents. 

- A {du) = B{2-a,a) u^~" (1 - u)""^ l[o,i] {u) du, {a G (0, 1) U (1, 2)); this is the 
beta(2 — a, a) coalescent. 

- (Lebesgue) A{du) = l[o,i] (u) du : this is the Bolthausen-Sznitman coalescent or 
beta(l,l) coalescent. 

- A (du) ~ So : we get the Kingman coalescent where only binary mergers can occur 
{j ~ i — 1) one at a time, [26]. 

- A (du) — 6i : we get the star-shaped coalescent involving a single big collision, o 

Using the above distributional identities, it can be shown for instance that, for the 
Kingman coalescent and for large i, to leading order of magnitude, rough estimates 
are: Tj,i - 2 (1 - l/i) gO], ~ 21ogi [12], Lf ~ 2 [22], ft = i - 1 and ~ 1/i [9]. 

For the Bolthausen-Sznitman coalescent: t^ i ^ loglogi Tf], Li ^ i/logi [12], 
Ci ^i/logi [21] and h ^ 1/logi 15 . 

For the beta(2 — a, a) coalescent with < a < 1, Li ^ i [29] and Li/ LI converges 
in probability to 1 [31] and k ^ O (1) 16J. 

For the beta(2 - a, a) coalescent with 1 < a < 2, - l/i"-'^ [24j, Lf - l/i"-^ 
[11] and k - l/i"-i [To]. 

This information is important to grasp the general shape of the coalescent trees in 
each case. 

3. GCLT FOR Pareto sums 

In this Section, wc first sketch a loose connection of the large N estimate of cn 
with the Generalized Central Limit Theorem for random variables in the domain 
of attraction of stable laws, [41 . 

Let Ejv — '^n=i -^n be the partial sums of the i.i.d. Xs with Pareto(a) distribu- 
tions, a > 0, on (1, oo). Let Sa be skewed a— stable rvs with 

E {e'^^") = e-^" if a e (0, 1) , A > 



— means equality in distribution between random variables. 
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the Laplace- Stieltjes transform (LST) of a one-sided a— stable rv, 

E (e'^-^i) = g-|A|(l+^sign(A)f log|A|) 

the characteristic function (c.f.) of a skewed 1— stable Cauchy rv on R 

E(e*^^°) = e-l^l°(i-''"sn(A)ta„(^)) if Q, e (1^2), A e M 
the characteristic function (c.f.) of a skewed a— stable Cauchy rv on M and 

E (e'^^") = e-^'/2 if a > 2, A G R 
the c.f. of a standard normal rv on R. 

The following Generalized Central Limit Theorem (GCLT) then holds 

Theorem 2. i^|41). |42j ).• Let Ejv denote the partial sum sequence of N i.i.d. 
Pareto{a) random variables. Then, 

/-^ ^AT ^ 0.N d o 

where, with 

Co. - (r(l-a)cos(^))'^" */ae(0,2)\{l}, 




\a — 2 

bpf is given by 

(i) bN = CaN'^''''^^^°''^/^Uf a ^ 2 
[a) bM = {NlogNf^^ ifa^2 
and, with 7 the Euler constant and E {Xi) = /i := a/ {a ~ 1), qm is given by 

(i) aN = Oifae (0, 1) 

Nn'^ f°° /2x\ ( 2\ 

(ii) aN = \ sin( — jdFxi(a;)-A^logiV + Af (1 -7-log-j i/a = l 

{iii) aN — 7V/i i/ a G (1, 00) . 

When a < 1, the characteristic values of E^v can be guessed to be what they are 
claimed to be (iV^/" if a < 1 and N log if a = 1) while estimating N /™" xfxi {x) dx 
where niN is the mode of Mn which is seen to grow like iV^/" (see [4 ) s-i^"^ sim- 
ilarly for the fluctuation scaling term ^at for a < 2 (of order 7V^/" if a < 2 and 
(iVlogA^)^^^ if a = 2). 

From these rough estimates, we would conclude that with cn oc N , up to 

the leading order in N 

(C TVV")^"" 
Cat cx '—— ==0(1) if a e (0, 1) 
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(mogNf^ 1 

Cat oc A' ^ ^ — it a = 1 

(iVlogiV)'^ logiV 

oc N ^^^^\ - ^-"iV-("-i) if a e (1, 2) . 
Depending on the values of a, we therefore anticipate 

• a e (0, 1): Because in that case cat is asymptotic to a constant, this suggests a 
limiting discrete-time coalescent. We will show below that it is not a discrete-time 
A— coalescent, rather it is a S— coalescent of the Poisson-Dirichlet type with two 
parameters (a, — /3). 

S— coalescents were first introduced in [30] and further studied in [S^- In sharp 
contrast with A— coalescents, multiple collisions can occur simultaneously at the 
same transition time. In their block-counting version, they are characterized by the 
set of numbers 0^ {ii, defining the probabilities of an (zi, —merger (ii -|- 
.. + ij — i), resulting when the 5— coalescent is discrete, in an i to j < i transition 
with probability P^j = jj Eli+...+i,=i --'^j)- '^^j 

written as 



J n^'' '^Aj{dui,..,duj) 



for some finite measures Aj with density on the (j + 1) — simplex 



A, 



|(-iti, e [0, 1]^ : ui + ...+Uj < l| 



The set of measures Aj, j > 1, (characterized by their moments (j)j), with values 
over the simplices A^ , completely characterize the S— coalescent, |30| . 

In the simplest cases, with {u,u) := X];>i it is also convenient (see [37] ) to 



rewrite the (j)jS as 



all distinct 



where A = (ui, .., ui, ..): ui >..> ui >..>{) : X]/>i !i l| ^-nd ^ a finite 

measure concentrated on the subset A* of A consisting of those us exactly summing 
to 1. Letting v (du) := 'E.{du) / {u,u) , the measure v on the infinite simplex A is 
such that v (A) < oo. 



• a = 1: A A— coalescent with logarithmic effective population size where A is a 
beta(l, I — /3) probability measure with /3 < 1 as in f7| (reducing to the Bolthausen- 
Sznitnian coalescent when avoiding size-biasing corresponding to /? = 0). 

• a G (1,2): A A— coalescent with Ne cx N"^^ where A is a beta(2 — a,a — /3) 
measure with (3 < a. In the latter case, /3 = leads to the standard beta(2 — a, a) 
coalescent. 
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• a > 2 : In the range a > 2, (respectively a = 2), A^e oc iV (respectively 
A^e oc N/logN) and the obtained scaled limiting coalescent is a Kingman coa- 
lescent, as a result of A approaching 5q. In this latter case, only binary mergers 
occur in the large A'^ limit. 



Remark: The Kingman coalescent also occurs when dealing with sampling from 
some alternative random partition. For instance, would sampling be defined from 
a random partition of unity given by 5„ := X„/^^ Ar„, n = 1, A^ where Xi 
now obeys the following gamma(6') density: fx^ (x) = T {6}^^ x^^^e~^ , 9,x > 0, 
then the law of Sat = Xi + ...+Xn is /e„ i^) ^ T (NOy'^ x'^^'-^e-'' , independent 
of Si and 



^1 = ^7 = 7 V^^NEfSl) 



^r(7v^jMi+^^^-(,-i)r(^ 



r(e') T{N9 + i) T{0) 

Thus CN = P^V = ^ together with dj, = P^^^ = Be- 

cause triple mergers are asymptotically negligible compared to binary ones {(In/cn — > 
0), the time-scaled limiting coalescent using cat — j;^-^ oc A^^^ is a Kingman 
coalescent. The prefactor appearing in front of cat is the ratio p/iJ,^ where 
/9 := E (Xf) ^e{e + l) and ^ := E (Xi) = 9 and cjv is independent of fi. 

4. Large A^ asymptotic estimation of E (^^n^ and consequences 

In this Section, we compute the asymptotic behavior of the /3— moments of Ejv 
in the cases a S (0, 1) , a G (1:2) and a = 1, and a > 2, making the previous 
conclusions based on the GCLT consisteni0. 

• We start with the case a S (0, 1) . 

Theorem 3. When a £ (0,1) , as N ^ oo, Xk = limAr_i.oo aj^,^"* exists and is a 
discrete-time Poisson-Dirichlet{a, ~l3) 'E— coalescent. 

Proof: For the values of (3 for which it makes sense, we have 

When A^ is large, only the small A approximation of E (e^'^'^^) to the latter integral 
contributes. For small A, we have 



/oo 1*00 
^-(a+i)g-A.^^ ^l^aj (l - 



dx 



OO 



1-a I (1 - e-^-) ^ 1 - r (1 - a) A" ^ .-r(i-a)A° 







^The technique we use is inspired from the one used in [7] in a particular case. The author is 
indebted to B. Derrida for pointing this out to him. 
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Note that, when A is small 

E (Xje-^^i) = (-1)^ — E (e-^^^) ^ aV (i - a) A"^\ 

Thus, 

With the change of variables u — NT (1 — a) A", with (3 < a, we get 

Finally, using the above large N estimate of E (^at) i the identity T [x + 1) = xT (x) 
and Q with i — 2, when /? < a < 1, we obtain 

^^,^ ^(^^-") r(2-a)r(a-/3) _ ^ r(l-/3)(l-f) r(2-a)r(a-^) 



E 



r(2-/3) r(i-(/3-a))r(i-«) r(2-/3) 



r(i-^) r(2-Q)r(Q;-/3) _ i - a _^ 

Coo • 



r{a- P)T{l-a) r(2-/3) 1-/3 

Thus the coalescence probability cjv converges to Coo £ (0,1)- When a G (0,1), 
using again (|4]), we obtain more generally 

pW . p. _ r(i-/3)r(»-a) 
iv->oo r(i-a)r(i-/3)' 

which are the probabilities to merge all i particles in one step in the limiting discrete- 
time coalescent. 



To derive the full transition probabilities of the limiting discrete-time-fc coalescent 



Xk limAT^oo ^i^"*: recalling that 



J \ 1 /-oo 3 



(9) E(^I]^n5rj=^,^^ dX-X^-''-'X{^{X- 

j 3 nOO 



-Nr{l-a)X° 

1=1 -^0 

Performing again the change of variables u — NT (1 — a) A", with /? < a, we get 

E (^'n fl ) - «-^iv^/"-r (1 - af-' 7(7-^"^ n r . 

Using (^) ^ /j\ for large TV, we finally get -P/^'' ^ -Pi,j where (1 < j < i) 

(10) r ^'a'-i ro--/3/a) ^ A r(»,-a) 
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These are the full transition probabilities of the limiting discrete-tinie-/c coalescent 
Xk in the regime a € (0, 1) and (3 < a (satisfying — !)• Note that Pi i are 

the probabilities obtained previously and that the diagonal terms (the eigenvalues 
of P) read 

^ r{l-(3) r{t-(3/a) 

r(i-/3/a) r(z-/3) • 

Clearly this discrete-time coalescent is not a discrete A— coalescent as simultaneous 
multiple collisions can occur (it is a S— coalescent). Clearly Pij is also 

Pu = ^ E (^ ; )0,(*i,. 

where the 0^ (ii, .., ij)s define the probabilities of a (ii, .., ij) —merger (zi + .. + ij — 
i). These (pj (zi,..,ij), which can be read from (jlOl) . may be written under the 
alternative form 

, .X ,-1 r(l-/3) r{j-l3/a) ^ T{ii-a) 



where 

_-rr r((/-i)Q + i-/3) 
llr(i-a)r(/a-/3)- 

Thus (Uni) is also 

(11) p.,-c,,^^,-^^^—^ E 11 

Defining the finite Dirichlet measures Aj with density on the (j + 1) — simplex Aj 
given by: 

- ■ V aj-B-l 



"^"^ i=l \ 1=1 J 

we get 

(ii, .., ij) = Coo / ]^iiJ'"^Aj (dui, . 
•^-^j i=i 

The set of finite Dirichlet measures Aj with parameters 

(01=2- a, e,=2- a, 6j+i = aj - /3) 

on the simplices Aj completely characterize this limiting discrete-time coalescent. 
Note that Ai is a beta(2 — a, a — /3) probability measure. 

One may rewrite the ^^s as (see [37] and [31]) 



ki .. . . ,kj l—l 



PARETO GENEALOGIES IN A POISSON EVOLUTION MODEL WITH SELECTION 13 



where A — .., ui, ..) : ui > .. > ui > .. > : J2i>i '^i — l} ^^'^ " ^ mea- 

sure on A. Letting v {du) := S {du) / {u, u) , the measure v on the infinite simplex A 
can be identified (see [31]) to the two-parameter Poisson-Dirichlet(Q;, — /3) measure, 
with a € [0, 1) and (3 < a. It holds that v (A) = 1. Poisson-Dirichlet measures 
enjoy many remarkable properties including a stick-breaking property, 1351 . o 

Remarks: 

(i) From pT|) . the limiting situation a = also makes sense, leading to the one- 
parameter Poisson-Dirichlet (0, —(3) measure with P < a = 0. In this case, fi:om 

m 

r{i-i3) 

where Sij := 4j +ij=iY\l=i Jl ^'^s ^^'^ absolute first kind Stirling numbers. 



(m) Finally, avoiding size-biasing (/? = 0) gives rise to the discrete Poisson-Dirichlet (a, 0) 
coalescent with one-parameter a £ (0, 1) , appearing in [39j. 

• The case a G (1, 2) . 

Lemma 4. When a G (1, 2) , with ji :— ^j^j' /"'^ all l3 < a 

(12) CAT :=Pi^^ -a^-"B(2-a,a-^)A^~("-i) ^ 0. 

Proof: In this case, defining a a — 1 G (0,1), after an integration by parts and 
using the previous estimate ([8|) substituting a to a 

E (e-^^0 = - -« / x-^^+'^e~^^dx ^ e"^ - - (1 - F (1 - a) A'^) . 
a Ji a 

Thus, for small A, with ii :~ — ^ 

(13) E (e-^^i) - 1 - A/1 - F (1 - a) A" - e-^^. 
Thus, consistently with the GCLT approach, to the dominant order 

so that 

CN ■■= Na^^ (2 - a, a - /3) - a^'^B (2 - a, a - /3) N~^°'~^\ o 

This suggests that 

Proposition 5. When a G (1,2), upon scaling time using an effective popula- 
tion size Ne ~ (with cn as in we obtain a limiting continuous-time-t 
K— coalescent: Xt = limTv-s-oo a^j^^^^j, with A a beta(2 ~ a,a — (3) probability mea- 
sure, f3 < a. If l3 = 0, we get the standard beta(2 — a,a) coalescent. 
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Proof: To confirm this point, we will first evaluate a large N estimate of Pj;^^ 
defined in in the range a G (1, 2) . Using the above small A estimate of E (e^^^^) 
in the parameter range under concern, for i > 2, we get 

E (Xje-^^i) = (-1)* ^E (e-^^^) ^ aV [i - a) A"-\ 
dX 

Thus, using ^ 

,7V-1 



E 



Finally, we obtain 



showing that, with cn — -P^^^ °^ ^ ^" — > 0, for each i, liniTv^oo Cj^P^^'^ exist 
and are strictly positive constants. More precisely, 

where Ai A is a beta(2 — a, a — /3) probability measure. 

To deal with the higher order terms, with m :— {I {1, j} : ii > 2}, assuming 
1 < J < let us write ^ as 

vj;,eiW.,^...^e_,^,„U...U e(i]^,) 

where the double-star sum is now over the ii, I — 1, m satisfying ii > 2. Proceed- 
ing similarly with higher order terms, it clearly holds that for all j > 2 and i; > 1, 
I = 1, j, satisfying > 2 for at least two I in the list. 



Jim c^^E (^E^n^rj/E(s^)=0, 

so that simultaneous multiple collisions cannot occur in the limit (actually the con- 
tribution of these simultaneous multiple collisions terms in P}^'^ is of order O (c^))- 
In the latter expression of P^^"^ therefore, only the term corresponding to m = 1 
wiU contribute to the O (cjv) -order. Because, E (Xie^^^^) = --^jE (e~^^i) - ^ 
and E {Xl^e'^^^) ~ aT {h - a) A""*^ with ii = i - j + 1 > 2, using (0), we get 

^ L= dX-y-^~^E(XYe-^^^) — ^ !^ '- 

afi^'^Tjii - a) r°° -jv^A ^ QA^^'^r (h - a) T {a - P + j - 1) 



E 

E 
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This shows, using (j ) (fT2|) and after some elementary algebra, that for 

j = - 1, 

-ip(iV) , ^ ( ^ \ B{i- j + \- a,a - P + i - I) 



where A = Ai ~ beta(2— a, a— /?). As a result, xt = limAr_^oo ^[t/lj^] is a continuous- 
time-t pure death coalescent process on N with infinitesimal transition rates Xij . o 

• The case a = 1. We view it as a limiting case of the previous analysis deriving 
from (1131) when a — > 1"*". 



Proposition 6. When a = 1, upon scaling time using a logarithmic effective popu- 
lation size Ne — c'^ ^ logiV, the limiting process Xt = limAr_j.oo ^[t/i^] exists and is 
a continuous-time-t K~ coalescent with A a beta(l, probability measure, /3 < 1. 

If f3 — 0, we get the standard Bolthausen-Sznitman coalescent with A uniform. 

Proof: Put indeed a = 1 + e in the previous small-A estimate of E (e^'^'^i) 
in the parameter range a G (1, 2), with e > small. Then /i ^ 1 + 1/e and because 

r(l-a) = r(-e) -0+ -i/e, 

(14) E {e-^^^) - 1 - A - i (A - A^+'') - 1 - A + AlogA / (A) . 



Thus, 



E 



The leading contribution of /(A)^ is, when A is small, of order l/(A^logA^). 
Putting A = It/ (TV log TV) 

I{Xf ^e-^[l + (log u - log log N-l) 

Thus, with P <l 



E 



(15) -(TVlogTV)^ (^l + ^^(7A(l-/3)-loglogiV-l) 

where ip {x) = F' [x) /F (x) is the digamma function. In the latter estimate, we 
used that differentiating Jq°° du-u~^u^e~^ = F (1 — /3 + 6*) with respect to the extra 
parameter 9 and then putting 9 — gives du ■ log [u) e^" — T' [1 — (5) . 

Finally, consistently with the GCLT approach 



CN = N- ^ ' 



1 



E 



log 
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Clearly, with A a beta(l, 1 — /3) probability measure and (3 < a ~ 1 
-IdW f' ^-2^ , . ^ r(2-/?)r(i-l) ^ 

Using similar arguments on higher order terms, showing that simultaneous multiple 
collisions do not contribute in the limit, one can easily show 

(16) c],'P,^P ^ f u^~^-' (1 - uf-' A {du) 

•Aj ,j J j 1, j — 1, 







N- 

B{i-j,3-P) 
5(1,1-/3) 



where A = Ai ~ beta(l, 1 — j3). This confirms that, when a = 1, upon scaling time 
using an effective logarithmic population size — ^ log we get a limiting 

continuous-time-t A— coalescent xt = limAr_).oo Xj^^^^j, where A is a beta(l, 1 — /3) 
probability measure, /3 < 1. o 

• The case a > 2. 

Proposition 7. When a > 2, for any value of j3, with ji :— ^j^y > and p 
-^^^ > 0, upon scaling time using a linear effective population size = = 

Nfi^/p, the limiting process Xt = limjv^oo ^[t/lj^] exists and is the continuous-time- 
t Kingman coalescent. 

Proof: If a > 2, Sat is in the domain of attraction of the normal law. As a result, 
for smaU A, with := E (ATi) = ^ and p := E (Xf) = ^ 

(17) E(e-^^i) -l-A^+ipA'-e-^'^. 

From (dZl), for small A, E(A:ie-^-^i) = --^E{e-^^^) - a* and 'E^Xfe-^^^) = 

We have cat ;= R^^^ = jV ^^^"/p with, from © 
E ' " 

E 

Thus, 

goes to as claimed. As in Proposition 5, simultaneous multiple collisions cannot 
contribute in the limit. Only the term in the expression of -P/^^ corresponding to 
m — 1 will contribute to the O (c^r) —order and so we need to focus on a collision 
with a single ii > 2. 

Now, from (H?]), E(A:ie-^-^i) = -^E(e-^^i) - n and i:, {Xl^ e~^^^) - p6^,^2 
with ii = i — j + 1 > 2. Proceeding again as in Proposition 5 therefore, only 
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effective transitions from i to j < i with j = i — ii + l = i — I arc seen in the Hmit, 
corresponding to the binary mergers of a Kingman coalescent. <> 



• a = 2. To complete the picture, it remains to study the limiting critical case 
a = 2. 

Lemma 8. When a — 2, for all values of /3, the coalescence probability goes to 
as N ^ oo like 

1 logA^ 



cn 



2 N 



Proof: We view the case a = 2 as a limiting case of p7)) as a — ?> 2+. When A is 
small and for a > 2, we have 

E (e-^-^i) - 1 - A^i + ipA^ - r (1 - a) A". 

Putting a = 2 + e, for e > small, using F (—1 — e) ~o+ V^i '^ith p ^ 2/e, when 
A is small, we have 

(18) E (e-^^') r^l-2X+-X^ - -\^+'' - 1 - 2A - A^ log A - e"^^. 

Because E (Xig-^^i) = -^jE (e-^^i) - 2 and E (X^g-^^i) = ^E (e-^^i) - 
—2 log A — 3, we have 



E 



(S^^f) - j,^]|°°dA.A-^E(X^--OE(e 

/ dA-Ai-'^(21ogA + 3)e-2^- 



El 



r(2 






1 


r(2 




1 


/ 



(^^) ^ f?^/ . A-''- V^^^ = (2iV/ 



r(-/3)7o 

The Euler integral with a logarithmic term inside appearing in the expression of 
E ^E^S"^^ can be obtained while taking the derivative of c?A- A^~'^A^e"^^'^ with 
respect to the extra parameter 6 and then putting = in the obtained expression. 
Observing therefore 



d\-X^-f^ log (A) e-2W^ = r' (2 - /3) {2N)-^'^-^^-T (2 - p) {2N)-^'^-^^ log {2N) , 

to the dominant order in iV, 

(S^^f) ^2(2iV)-(2-^hog7V, 



E 

leading to 



" -i-A / fl \ 1 AT" 

E 



2 iV 



Remark: A similar scaling behavior for cat was recently obtained in Theorem 2.4 of 
[20| , dealing with coalescents arising from compound Poisson discrete reproduction 
models, in the critical case. 
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Proposition 9. When a — 2, upon scaling time using an effective population 
size Ne — c'^ — {2N) / log N, the limiting process xt = limTv^oo ^[^/Ij^] is the 
continuous-time-t Kingman coalescent. 

Proof: Using (US]), for small A, we have 

d\ 

to which one should add E {Xie"^^^) - 2 and E {Xle-^^^) (2 log A + 3) . For 

i > 3, we get 

^ H;(i-|f dA.A-V- = 2^1i^^. 



E 



Thus, for all i > 3, as — ?> cxD 



1 



Therefore, due to the extra factor log N appearing in cjv , the transitions from i > 3 
to 1 cannot be seen in the limit, nor (for the same reason) the transitions involving a 
single multiple collision with ii > 3, nor transitions involving simultaneous multiple 
collisions of any order. In fact, only the events involving a single multiple collision 
with ii ~ i — j + 1 = 2 (corresponding to transitions from i to j = i — 1) will 
contribute in the limit. Indeed, 



E 



F.H) ^" V 1 ; E(i]^)r(z-/3) 

^ / dA-A'-'^-i(21ogA + 3)e-2^'^ - -iV-MogiV. 



E 

This shows, using 



1 \2 



E 



with j = i — 1, (^) ^ and after some elementary algebra, that 

c^v.'»>^^^A.,._..(; 

Remarks: 

(i) Results of a similar flavor can be found in Schweinsberg's work j39j. However, 
his model and techniques are different from ours because he considers large— 
limiting coalescents obtained while sampling without replacement from a discrete 
super-critical Galton- Watson branching process, assuming the reproduction law of 
each offspring to exhibit power- law Zipf tails of index a (a in his notations) . Note 
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that there is no parameter /3 in the construction i39i . In the same spirit, results 
can also be found in Huillet-Mohle [19] where, following [13], A— coalescents are ob- 
tained as scaling limits of discrete extended Moran models, the skewed reproduction 
law of which displaying occasional extreme events with one individual allowed to 
produce a large amount of offspring. Whenever the reproduction law displays sys- 
tematic extreme events, discrete coalescents were even shown to emerge in the large 
A^— limit, but in this Moran context, they are only A— coalescents with multiple but 
no simultaneous collisions, |19] . This contrasts with the occurrence in our present 
work of discrete Poisson-Dirichlet S— coalescents. In contrast also with our current 
work where coalescents are derived from a sampling procedure in the continuum, 
the coalescents considered in [39] and [19] were built from discrete reproduction 
laws at fixed population size N and looking at their scaling limits N oo. 

We refer to these two works and to [7] for additional background on A— coalesccnt 
processes adapted to our purposes. 

(ii) When a > 2, neither the (large N estimate of the) scaling constant nor the 
limiting (Kingman) coalescent depend on the bias parameter /3. 

(in) We observe that the probability cn -P^^'* that two individuals chosen at 
random share the same common ancestor, defining the time-scale to derive the large- 
A^ limits of the Pareto-coalescents all have the same large- A^ order of magnitude as 
the length Zjv of an external branch chosen at random in the limiting A— coalescents 
Xt {a G [1)2)) or coalescent Xk {a € (0, 1)), or Kingman coalescent xt (a > 2), 
started at Xq = N. This curious fact is unexplained so far. 



5. Forward in time selection model and genealogies 

In this Section, in the spirit of [7], we indicate that the coalescent processes just 
discussed may be viewed as the genealogical processes of some forward in time 
evolving branching population models with selection. As it is often the case in 
population genetics, the process we are interested in is in the class of branching 
processes conditioned on having a fixed population size over each generation, in the 
spirit of [25] . 

5.1. A Poisson-point process model with selection. Start with A^ individuals 
at generation t = and assume that each individual has an initial fitness Xn (0) > 0, 
n= 1,...,N. 

To describe the state of the population at the next generation, assume first that, 
independently of one another, each individual potentially generates an infinite num- 
ber of offspring along a Poisson point process (PPP) with intensity (or occupation) 
density 

(19) ^,„(o) (x) = -7f;„(o) (x) = axn (0)" 



■^The ' symbol indicates derivative with respect to x. 



20 



THIERRY E. HUILLET 



where Trx„(o) i^) ■— {x/xn (0)) x > 0, a > 0, n = 1, ..,N. We observe that, with 
7f (x) := x"" 

- / N T^{x) 

TT [Xn (0)) 

and if we let tt (x) = — tt' {x) — ax~^°'~^^\ then 7ra;^(o) {x) = Xn (0)" tt (x) . 

So the fitnesses of the offspring of each individual is generated according to a PPP 
depending on the fitness of its parent. From these simple assumptions, and as 
conventional wisdom suggests, we get: 

Proposition 10. In a PPP model for fitness-dependent offspring reproduction with 
occupation density m9\) . the fitnesses of the offspring of some parental individual 
with fitness x„ (0) are a;„ (0) times the fitnesses of the offspring of some canonical 
individual with unit fitness: In this sense, the larger the fitness x„ (0) of some 
individual is, the more he will, proportionally, produce offspring with large fitness. 

Proof: Let (r„; n > 1) be the points of a standard homogeneous Poisson point pro- 
cess (PPP) on the half-line with rate 1, and let tF"^ (s) — s~^^" be the decreasing 

inverse of W. Then, with ^^^^(Q-f (s) = a;„ (0) • s^"'^/", (^^^(^q-^ (tu) ',n>lj are the 
(ordered) points of the offspring PPP on the positive half-line (or here the fitness 
space) with occupation density tTj-^jq)- So, the fitter the individuals, the fitter their 
offspring, proportionally to the parental fitness, o 

Let us also briefly emphasize that, avoiding the fitness dependence on the parent of 
the PPP, (tt"^ (r„) ;n > 1) are just the (ordered) points of a PPP on the half-line 
with occupation density tt. Whenever, as in our case study here, the rate function 
TT is not integrable up to a: = 0, there are infinitely many such points, with as 
an accumulation point whereas there is of course a finite Poisson (with mean tt (e)) 
number of them above some threshold e > 0. 

ft is well-known that when a G (0, 1), with W (x) :— and (s) = s^^/"', the 
positive cumulative rv 

(20) x = J2^-'{rn) 

n>l 

is a one-sided a— stable rv on (0, oo) with LST E (e^'^^) = e""'^" , k = T {1 — a) > 
0, A > 0. 

When a G (1,2), the law of x is the one of a positive Lamperti rv with LST 
E (e-^x) ^ g-cA+«A° ^ K = -r (1 - a) > 0, c := E (x) > 0, f27. When a = 1, the 
law of X is the one of a positive Neveu rv with LST E (e"'*''^) = e'^'°S'^ (see [32]); 
the latter may be viewed as a Lamperti rv in the limit a — ^ 1^, [18) . 

Typically indeed, by the Levy-Khintchine formula, tt (x) dx stands for the Levy 
measure for the non-negative jumps of x which is the value at time t = 1 of an 
infinitely divisible subordinator (xut > 0) with LST E(e"^'^)*, [2]. In the 
terms 7f~^ (r„) are thus the ranked jumps of x in its Levy decomposition (see [33] 
for example). 
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Reproduction step. 

Because we consider the offspring of all the N initial individuals with fitnesses 
Xn (0) , n — 1, N , the step— 1 state of the whole population is thus obtained from 
a PPP with global equivalent occupation density 

N 

(21) 7r,„ „(o) (x) = ^ x„ (0)" = axN,a (0)" x~^"+'\ 

n=l 

where 

is the global equivalent initial fitness of the whole population at generation to 
consider. We get 

Proposition 11. In a PPP with occupation density for the descent of each 
individual with fitness a;„ (0) , n = 1, N , the occupation density of the population 

as a whole is given by Ii21\) . where XN,a{0) '■= ( X]n=i (0) ) is the global 
equivalent fitness. 

Proof: This follows from the superposition principle of Poisson point processes 
(see [25] p. 16). The fact that the intensity of the superposed PPP is in the same 
class as the one of a single PPP descending from x„ (0) is a remarkable scaling 
property of tt^^i^q) (x) ■ o 

In this setup therefore, tt^;^ ^(o) (a;) :— — 7r^„ ^(o) {x) stands for the occupation den- 
sity that there is a point at x descending from any of the N initial individuals with 
fitnesses Xn [0] , n — 1, N. 

Note that the cumulated fitness of all first-generation ofi'spring is 

Y.^'^nAO) i^n)^XN,a (0) -X, 
n>l 

where x is given by (^0]) . 
Selection step. 

In order to model a population with fixed size over the generations, the final state of 
the population at time 1 is obtained while selecting the N individuals of the whole 
population whose fitnesses are the largest (the selection step), truncating therefore 
the latter sum to its N first terms. 

The whole process (including reproduction and selection steps) is then iterated 
independently over the next generations. 

From this definition of the process, if the x„ (fc)s are the fitnesses of the N fittest 
individuals at generation fc, the ordered ones X(„) (fc + 1) at generation k + 1 (x(i) > 
... > a;(Ar)) descending from the whole population at step k are given by X(n) = 
7r~^ (fc) ('''") ~ XN,a (k) Tn^^" . Here the t„s are the ordered points of a standard 
Poisson process on the half-line with ri < ... < r„ < ... < r^v- The law of t„ is thus 
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the one of an Erlang ganima(n) rv with density /r„ (s) = s" /T (n) . Further, 
given tn+1 = s, the probabihty density of ti, ...,tn is 

iV! 

/ri,...,TN (Sli Sat I T AT+i = s) = — jv lo<si<...<SN<s 

and so 

/•OO 

/ti,...,tjv (si) --J SAf) = lo<si<...<sjv / f^Se"" = e^'*"lo<Sl<...<s„• 
JsJv 

The joint law of the ordered x„ (fc + l)s is thus the one of the images tt^^ (■'■n)s, 
namely 

/x(i)(fc+l),...,2.(„)(fc+l) (a;i,..,XAr) = e"''"">°"='^"'"' J| 1"a;iv,c(fe) (2^") '^xi>...>XN- 

n=l 

Clearly also (with the two terms in the right-hand side term mutually independent), 

X(N+1) (fc + 1) = XN,a (k) xlf^i (fc + 1) , 

where x*^_^_i {k + 1) is the (A^ + 1) — st largest point of a PPP with occupation 
density tt (x) — ax~^°'^^\ x,a > 0. By the image measure theorem, the density of 
^iv+i + 1) = T^^^ (tn+i) is obtained as a power-gamma density 

(22) (x) = X > 0. 

Next, the conditional density of each Xn {k + 1) given X(jv+i) (fc + 1) = x is 

fx„(k+l) {xn I X(Nj^i) {k+l)^x) = ^^^^lx„>x, 

^ ' ^ ' n (X) 

showing that (with the two terms in the right-hand side term mutually indepen- 
dent), Xn (fc + 1) = 2^(^+1) (fc + 1) Xn {k + 1) where X„ {k + 1) is a Pareto(Q;) dis- 
tributed rv with density / {x) = ax^^°'^^'' on (1, 00 jl. Putting all this together, we 
obtained 

Proposition 12. Independently for each k > 0, with x*^^^^-^ {k -\- 1) having the 
power- gamma distribution and Xn {k + 1) being Pareto{a) distributed and with 
the three right-hand side terms being mutually independent, 

(23) Xn (fc + 1) ^ XN^^ (fc) (fc + 1) X„ {k + I) , n ^ 1, N, 

indicates how to update multiplicatively the fitness of the n—th individual at genera- 
tion fc-f 1, when the fitnesses of the previous generation are summarized in x^^a (k) ■ 

We also clearly have an update of the global equivalent fitness XN^a from step k to 
step fc -t- 1 as: 

AT \ 1/" 



Corollary 13. With x^^a (k) yJ2n=i ^" (^)" j global equivalent fitness 

of the whole population at generation k, the following recursion holds 

/ N \ 

(24) 

XN,a 

[k+l) (fc) (fc + l)^x„(fc + l)" . 



We used the scaling property of Pareto(a) rvs X on (1, 00) stating that X \ X > a = aX. 
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In the latter sum term, each is thus a Pareto(l) distributed rv with density 
/ {x) — x^^ on (1, oo) and we need to sum N of them independently which is rem- 
iniscent of 

Large TV asymptotics of the a— mean fitness. 
Defining 



to be the generalized (Holder) a— mean of the fitnesses Xn, n = 1, ...,iV, at genera- 
tion k, it follows from (l23l) that 



(25) (x)^^(fc + l)i(x)^_„(fc)x^+i(fc + l) (^^X„(fc + l)"j . 

By Jensen inequality, the Holder a— means [x) ^ (A:) are non-decreasing functions 
of a. 

Corollary 14. The fi—moments of {x)j^ ^ at generation k are given by 



E 



Proof: This follows from the independence of the a;^^]^s and the X!^ and their 
i.i.d. character within each generation k and from (j25p . o 

Proposition 15. For large N, with :— log log iV 

T log (a;)jv,a (fc) ^m- 

With 



Fn W) log log TV - — ^ {iP (1 - /3/a) - log log TV - 1) 

a a log TV 



and /at (a) , a < 0, iis Legendre transform, the large deviation regime is given by 
^ logP (-^ log {x)^^^ (fc) ^ a") ^->^ fN (a) < 0. 



Proof: Observing that E fx*^, , ) = T (TV -I- 1 - /3/a) /F (TV + 1) - AT-/?/" and 

applying (jisp giving the moments of a partial sum of N i.i.d. Pareto(l) distributed 
rvs, it follows that, with (3 < a 



As a result, we get 



(X)„„(0)'^ large ^ V « log TV ' ^ ^ 
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Thus, for large iV, with 

Fn W) = loglogiV - — ^ (iPil- p/a) - log log iV - 1) 
a a log A* 

defining the concave thermodynamical 'pressure', 

-ilogE((x)^„(fcf) ^ F^iP). 
Thus, with a — Fjy {(3) < 0, by the large deviation principle 

(26) ^ logP (^-i log {x)j,^^ (fc) ^ ^->^ /jv (a) < 0, 

where /tv (a) — inf^<Q, (a/3 — i^Ar (/?)) is the concave Legendre transform of Fn, 
giving the large deviation rate function of —\og{x)j^^{k)/k. In particular, for 
large TV, with log log iV 

(27) \\og{x)^^{k) -F'^{Q)^-VN, 

gives the limiting right shift of the Holder a— mean fitness induced by selection 
effects, o 



Remarks: 

{i) The limiting right-hand-side term in (1271) . although increasing very slowly with 
N , does not stabilize to a limit, in contrast to other similar models [Tj of branching 
with selection where, in each generation, each individual produces only two off- 
spring with randomly shifted fitnesses. 



[ii) With a > 0, let / {x) := > define some (increasing) output map of the 
individuals fitnesses x, with x > 0. Defining 

N N 
n— 1 n— 1 

to be the mean output fitness in generation k of the whole population, then, what- 
ever a, (|?7)) is also 

il0g(/(x)}^ (fc) "4- VN, 

f\i K — y oo 

interpreting the speed vn itself. This suggests that it is of interest to work not 
only on the fitnesses x^ themselves (and their a— mean {x)j^ ^) but rather on some 
deformed version of the fitnesses x" (and their standard mean (/ (x))^). Clearly 
(/ (^))a' itself obeys the recursion 

N 

if (x))^ (fc + 1) = (/ (x))^ (fc) • (fc + 1)" • E ^" (fc + 1)" ' 

n=l 

deriving again from ()23p and the definition of the equivalent global fitness XN^a (k). 
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(in) Note finally that, given XN,a (k), the cumulative distribution function (cdf) of 
the fitness (fc + 1) of the fittest individual among the N individuals at genera- 
tion /c + 1 is given by 

P.„,„(,) ik + l)<x)=P (n) < x) = e-^^/^-^e^))-", 

where ti is exp(l) distributed. Thus the fitness of the fittest individual obeys 

x^if ik + l)^XN,c {k)Yik + l), 

where Y (fc), fc > is a sequence of i.i.d. Frechet rvs with cdf P (y < a;) = e^^ 
The conditional mean given XN^a (k) of (fc + 1) is XN^a (k) T (1 — and its 
median value XN^a (fc) (log 2) . More generally, when N is large and uncondi- 
tionally, due to the recursion (|24l) on the XN,a (fc)s: 

E (fc + 1)^) ^ XN,o. [of r (1 - /3/a) e-'=^"(«. 

5.2. Genealogies. Now we turn to the genealogies of this branching process with 
selection. 

The beta(l, 1-/3) and Bolthausen-Sznitman coalescents. 

Recall that Tr^j^ ^(k) (x) :— —t^^n (fc) occupation density that there would 

be a point (an offspring) of the PPP at position (with fitness) x at generation fc + 1, 
given a global population state XN,a (fc) ■ 

Proposition 16. Looking backward in time, upon scaling time using cn 1/ \ogN , 
the genealogy of the branching model with selection is a beta{l,l — (3) coalescent, 
reducing to the Bolthausen-Sznitman coalescent if /3 = 0. 

Proof: Suppose first (3 = 0. Given there is an offspring at x at generation fc -I- 1, 
the sampling probability that it would be an offspring of the individual with fitness 
Xn (fc) is thus 

T^x„ik)ix) _ aa:„ (fc)" _ x„ (fc)" 



'^XN.^{k){x) aXN,a{k)" X XN,a{k)°" 

which is independent of x. Observing from (|23)) and (p4)) that 

Xn [k) = XN,a {k ~ 1) X^+i (fc) X„ (fc) , n= 1, N, 

/ N \ 1/" 

XN,a (fc) = XN,a (fc - 1) 2:^+1 (fc) ^ (fc)" , 

this random probability is also 

Xn (fc)" 



where, for each fc independently, the X„s are i.i.d. Pareto(Q!) distributed on (1, oo) . 
Because in each generation, parents generate offspring independently and indepen- 
dently of one another, upon averaging, the probability that, at generation fc + 1, i 
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individuals share the same common ancestor is independent of k, with 



N 



n=l 



The Xn (fc)s being i.i.d. Pareto(a) distributed rvs, the X„ (fc)"s are i.i.d. Pareto(l) 
distributed rvs and we are thus back to the resuhs of Proposition 6 with (3 = 0, 
stating that with ^ l/\ogN, 

w . /\,^-2A r(2)r(z-i) _ 1 



r(i) i-i' 



where A '^beta(l, 1) , uniform. We can proceed similarly to derive the probabilities 
Pi'J^ that i individuals have j < i parents, behaving consistently with ([T6|) with 
/? = 0. We conclude that, whatever a, in the large N limit, the time-scaled ge- 
nealogy of the branching model with selection is a Bolthausen-Sznitman coalescent 
process, obtained while sampling from N Pareto(l) i.i.d. rvs. Would the sampling 
probabilities Pj;j ^ include a /3— size biasing effect on total length Eat := J2n=i ' 
the genealogy of this branching model with selection would be a full beta(l, 1 — /3) 
coalescent, provided /3 < 1. While adopting this sampling point of view to compute 
the coalescence and merging probabilities, we therefore obtain a limiting genealog- 
ical coalescent process which is independent of a. o 



Genealogies from the output PPP. 

What now if we set that the occupation density that, at generation fc + 1, there is 
an offspring at x descending from some individual with fitness x„ (k) at generation 
fc, is instead given by 

(28) T^x^(k) (x) Xn (fc) x"^, 

while distorting the original occupation intensity TTx„{k) {x) = ctXn (fc)" a;"^"^"'^'? 

Then the occupation density that, at generation fc + 1, there is an offspring at x 
descending from any individual of the whole population would take the form 

7rx„,i(fc) (x) := XN,i {k)x^^, 

where xn,i (fc) '■= Xn (fc) is the cumulative fitness in generation fc. 

If this were to be the case, given there is an offspring at x at time fc + 1, the 
sampling probability that it is an offspring of the individual with fitness x„ (fc) 
would be, thanks to (l23l) and (|24|) with a = 1 : 

^2g^ 7^x„(fc) jx) _ Xn (fc) _ Xn (fc) Xn (fc) 

1":riV,lW (^) XNAik)x-^ XN,l{k) ^^=1 ^" (fc) ' 

again independently of x. This probability now involves a normalized sum of the 
XnS, which are i.i.d. Pareto(a) distributed and the strategy to compute the merging 
probabilities of the ancestral process will be modified. 
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Under this hypothesis indeed, the probabiUty that, at generation fc + 1, i individuals 
share the same common ancestor reads 

where 5*1 is the normahzed segment size now obtained from N i.i.d. Pareto(a) 
distributed rvs, normahzed by their sum Eat (k) :— J2n=i -^n (k). We can proceed 
similarly to derive the probabilities Pi^"^ that i individuals have j < i parents and 
we are back to the studies of Sections 2 — 4. And we can as well /3— size-bias these 
sampling probabilities on the total lengths Sat. We call this sampling procedure 
the distorted sampling procedure. 

Proposition 17. Looking backward in time, using a distorted size-biased sampling 
procedure, the genealogy of the branching model with selection is 

- a continuous-time Kingman coalescent if a > 2 (upon scaling time with cjq oc 1/iV 
if a > 2 or cn (X log N/N if a ^2). 

- a continuous-time beta(2 — a, a ^ /3) coalescent if a G (1,2) and j3 < a (upon 
scaling time with cn (x. N~^"^^^ ). 

- a continuous-time beta{\, 1 — (5) coalescent if a — 1 and /3 < 1 (upon scaling time 
with Cm oc l/logTVj. 

- a discrete-time Poisson-Dirichlet{a, —jS) coalescent if a & (0, 1) and j3 < a. 

Proof: It remains to interpret the distorted size-biased sampling procedure which 
is proposed to compute the merging probabilities of the ancestral process: Assume 
that the fitness dependent PPP describing the descent of an individual with fitness 
Xn (k) is now the output image of the original one, given by the canonical application 
fx^(k) {x) = {k) {x/xn (fc))" , fx^(k) ■ — > 1R+. Its gives risc to a new PPP with 
distorted intensity i^f^^^^^ (dx) — Xn (k) x^'^dx, the image measure of T^x„{k) (dx) = 
T^xnik) {^) dx = axn (fc)" by fx„{k) & result of the classical Campbell 

formula for PPPs, see [25] p. 28). This was the starting point in ((28|) . The 
skewed computation in (j29p of the probability that some offspring is descending from 
one individual with fitness Xn {k) is thus based not on the original PPP attached 
to Xn{k) but rather on a deformed version of it through fx„{k)- We note that 
fx„{k) (x), as a function of the two arguments (x„(fc),x) is homogeneous with 

/A"x„(fe) (a':e) = A'^(i-")+^"/,„(fe) (x), A > 0, leading obviously, if A = a:„ (fc)-l/^ 

b/a = 1 and / (a;) :— fi {x) = x" to: fx„{k) (x) = a;„ (k) f {x/xn (k)). The function 
/ (x) = x" is the output fitness function introduced in Subsection 5.1, Remark(M). 

Using this distorted size-biased sampling procedure therefore, following the intro- 
ductory arguments, the full class of the Pareto-coalescents (described in Sections 
2 — 4) are obtained. 

Suppose for instance a G (1,2), (3 = 0. Based on the previous computations of 
Sections 2 — 4, we conclude that the large N distorted genealogy of the branch- 
ing model with selection coincides (upon scaling time correspondingly: k — >■ [t/cN]) 
with a beta(2 — a, a) coalescent process, obtained while sampling from N Pareto(a) 
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i.i.d. rvs, normalized by their sum. Would this probability involve a /3— size bias- 
ing effect, the genealogy of this branching model with selection is identified to a 
beta(2 — a, a — /3) coalescent, (3 < a. 

If a S [0,1), /3 < a, the obtained large N genealogical coalescent will coincide 
with the discrete-time-fc Poisson-Dirichlet coalescent with parameters a and — /3. 
Only when a = 1 do we get as in [7] (upon scaling time logarithmically with N) 
the Bolthausen-Sznitman coalescent (J3 = 0) or more generally the beta(l, I — /3) 
coalescent, provided /3 < 1. o 
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